SEQUENCE LISTING 



<110> Lim, Moon Young 

Edwards, Cynthia A. 
Fry, Kirk E. 
Bruice, Thomas W. 
Starr, Douglas B. 
Laurance, Megan E. 
Kwok, Yan 



<120> DNA Binding Compound-Mediated Molecular 
Switch System 

<130> 4600-0130.30 

<140> US 09/518,297 
<141> 2000-03-03 

<150> US 60/122,513 
<151> 1999-03-03 

<150> US 60/154,605 
<151> 1999-09-17 

<160> 77 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> DNA response element 

<400> 1 
cgttcgcact t 

<210> 2 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> DNA response element 
<400> 2 

cggagtactg tcctccg 

<210> 3 
<211> 12 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> DNA response element 

<221> misc_feature 
<222> (1) . . . (12) 
<223> n = A,T,C or G 

<400> 3 
taattanggg ng 

<210> 4 
<211> 551 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> VARIANT 
<222> (0) . . . (0) 

<223> transcriptional regulatory protein 
<400> 4 





Glu 


Leu 


Phe 


Pro 


Leu 


He 


Phe 


Pro 


Ala 


Glu 


Pro 


Ala 


Gin 


Ala 


1 






5 










10 










15 




o c x. ui y 






Val 


Glu 


He 


He 


Glu 


Gin 


Pro 


Lys 


Gin Arg 


Gly Met 






20 










25 










30 






Arg Phe 


Arg 
35 




Lys 


Cys 


Glu 


Gly 
40 


Arg 


Ser 


Ala 


Gly 


Ser 
45 


He 


Pro 


Gly 




Ser 


X I1X. 


lien 

AO 


Thr 


Thr 


Lys 


Thr 


His 


Pro 


Thr 


He 


Lys 


He 


Asn 


50 










55 










60 










oi y iyi 




ox y 


Ir X. <J 


vjx y 


X I IX. 


VaX 




Tip 


OCX. 


Leu 


Val 


Thr 


Lys 


Asp 


65 








70 










75 










80 


Pro Pro 


His 


Arg 


Pro 
85 


His 


Pro 


His 


Glu 


Leu 
90 


Val 


Gly 


Lys 


Asp 


Cys 
95 


Arg 


Asp Gly 


Phe 


Tyr 
100 


Glu 


Ala 


Glu 


Leu 


Cys 
105 


Pro 


Asp 


Arg 


Cys 


He 
110 


His 


Ser 


Phe Gin 


Asn 


Leu 


Gly 


He 


Gin 


Cys 


Val 


Lys 


Lys 


Arg Asp 


Leu 


Glu 


Gin 




115 










120 










125 








Ala He 


Ser 


Gin 


Arg 


He 


Gin 


Thr 


Asn 


Asn 


Asn 


Pro 


Phe 


Gin 


Val 


Pro 


130 










135 










140 










He Glu 


Glu 


Gin 


Arg 


Gly 


Asp 


Tyr 


Asp 


Leu 


Asn 


Ala 


Val 


Arg 


Leu 


Cys 


145 








150 










155 










160 


Phe Gin 


Val 


Thr 


Val 
165 


Arg 


Asp 


Pro 


Ser 


Gly 
170 


Arg 


Pro 


Leu 


Arg 


Leu 
175 


Pro 


Pro Val 


Leu 


Pro 
180 


His 


Pro 


He 


Phe 


Asp 
185 


Asn 


Arg 


Ala 


Pro 


Asn 
190 


Thr 


Ala 


Glu Leu 


Lys 
195 


He 


Cys 


Arg 


Val 


Asn 
200 


Arg 


Asn 


Ser 


Gly 


Ser 
205 


Cys 


Leu 


Gly 


Gly Asp 


Glu 


He 


Phe 


Leu 


Leu 


Cys 


Asp 


Lys 


Val 


Gin 


Lys 


Glu Asp 


He 


210 










215 










220 










Glu Val 


Tyr 


Phe 


Thr 


Gly 


Pro 


Gly 


Trp 


Glu 


Ala 


Arg 


Gly 


Ser 


Phe 


Ser 


225 








230 










235 










240 


Gin Ala 


Asp 


Val 


His 


Arg 


Gin 


Val 


Ala 


He 


Val 


Phe Arg 


Thr 


Pro 


Pro 








245 










250 










255 




Tyr Ala 


Asp 


Pro 


Ser 


Leu 


Gin 


Ala 


Pro 


Val 


Arg 


Val 


Ser 


Met 


Gin 


Leu 




260 










265 










270 






Arg Arg 


Pro 
275 


Ser 


Asp 


Arg 


Glu 


Leu 
280 


Ser 


Glu 


Pro 


Met 


Glu 
285 


Phe 


Gin 


Tyr 



Leu Pro Asp Thr Asp Asp Arg His Arg lie Glu Glu Lys Arg Lys Arg 

290 295 300 

Thr Tyr Glu Thr Phe Lys Ser lie Met Lys Lys Ser Pro Phe Ser Gly 
305 310 315 320 

Pro Thr Asp Pro Arg Pro Pro Pro Arg Arg lie Ala Val Pro Ser Arg 

325 330 335 

Ser Ser Ala Ser Val Pro Lys Pro Ala Pro Gin Pro Tyr Pro Phe Thr 

340 345 350 

Ser Ser Leu Ser Thr lie Asn Tyr Asp Glu Phe Pro Thr Met Val Phe 

355 360 365 

Pro Ser Gly Gin lie Ser Gin Ala Ser Ala Leu Ala Pro Ala Pro Pro 

370 375 380 

Gin Val Leu Pro Gin Ala Pro Ala Pro Ala Pro Ala Pro Ala Met Val 
385 390 395 400 

Ser Ala Leu Ala Gin Ala Pro Ala Pro Val Pro Val Leu Ala Pro Gly 

405 410 415 

Pro Pro Gin Ala Val Ala Pro Pro Ala Pro Lys Pro Thr Gin Ala Gly 

420 425 430 

Glu Gly Thr Leu Ser Glu Ala Leu Leu Gin Leu Gin Phe Asp Asp Glu 

435 440 445 

Asp Leu Gly Ala Leu Leu Gly Asn Ser Thr Asp Pro Ala Val Phe Thr 

450 455 460 

Asp Leu Ala Ser Val Asp Asn Ser Glu Phe Gin Gin Leu Leu Asn Gin 
465 470 475 480 

Gly lie Pro Val Ala Pro His Thr Thr Glu Pro Met Leu Met Glu Tyr 

485 490 495 

Pro Glu Ala lie Thr Arg Leu Val Thr Gly Ala Gin Arg Pro Pro Asp 

500 505 510 

Pro Ala Pro Ala Pro Leu Gly Ala Pro Gly Leu Pro Asn Gly Leu Leu 

515 520 525 

Ser Gly Asp Glu Asp Phe Ser Ser lie Ala Asp Met Asp Phe Ser Ala 

530 535 540 

Leu Leu Ser Gin lie Ser Ser 
545 550 

<210> 5 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> DNA response element 
<400> 5 

tccctatcag tgatagaga 19 

<210> 6 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> response element 
<400> 6 

cttaacactc gcgagtgtta ag 22 



3 



<210> 7 
<211> 13 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> response element 

<221> misc_feature 
<222> (3) . . . (3) 
<223> n = G or T 

<221> misc_feature 
<222> (7) . . . (7) 
<223> n = A,T,C or G 

<221> misc_feature 
<222> (12) . . . (12) 
<223> n = A or C 

<400> 7 

rgntcantga cny 

<210> 8 
<211> 77 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> activator sequence 
<400> 8 

Ala Pro. Pro Thr Asp Val Ser Leu Gly Asp Glu Leu His Leu Asp Gly 

15 10 15 

Glu Asp Val Ala Met Ala His Ala Asp Ala Leu Asp Asp Phe Asp Leu 

20 25 30 

Asp Met Leu Gly Asp Gly Asp Ser Pro Gly Pro Gly Phe Thr Pro His 

35 40 45 

Asp Ser Ala Pro Tyr Gly Ala Leu Asp Met Ala Asp Phe Glu Phe Glu 

50 55 60 

Gin Met Phe Thr Asp Ala Leu Gly lie Asp Glu Tyr Gly 
65 70 75 

<210> 9 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> activator sequence 

<221> VARIANT 
<222> (1) . . . (11) 
<223> tetramer 



<400> 9 

Asp Ala Leu Asp Asp Phe Asp Leu Asp Met Leu 



10 



<210> 10 
<211> 97 
<212> PRT 
<213> Artificia. 

<220> 

<223> repressor 



Sequence 



sequence 



<400> 10 


























Met Asp 


Ala 


Lys 


Ser 


Leu Thr 


Ala 


Trp 


Ser 


Arg 


Thr 


Leu 


Val 


Thr 


Phe 


1 






5 








10 










15 




Lys Asp 


Val 


Phe 


Val 


Asp Phe 


Thr Arg 


Glu 


Glu 


Trp 


Lys 


Leu 


Leu 


Asp 






20 








25 










30 






Thr Ala 


Gin 


Gin 


He 


Val Tyr 


Arg 


Asn 


Val 


Met 


Leu 


Glu 


Asn 


Tyr 


Lys 




35 








40 










45 








Asn Leu 


Val 


Ser 


Leu 


Gly Tyr 


Gin 


Leu 


Thr 


Lys 


Pro Asp 


Val 


He 


Leu 


50 








55 










60 










Arg Leu 


Glu 


Lys 


Gly 


Glu Glu 


Pro 


Trp 


Leu 


Val 


Glu 


Arg 


Glu 


He 


His 


65 








70 








75 










80 


Gin Glu 


Thr 


His 


Pro Asp Ser 


Glu 


Thr 


Ala 


Phe 


Glu 


He 


Lys 


Ser 


Ser 








85 








90 










95 





Val 



<210> 11 
<211> 36 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> repressor sequence 
<400> 11 

Met Ala Ala Ala Val Arg Met Asn He Gin Met Leu Leu Glu Ala Ala 

15 10 15 

Asp Tyr Leu Glu Arg Arg Glu Arg Glu Ala Glu His Gly Tyr Ala Ser 

20 25 30 

Met Leu Pro Tyr 
35 

<210> 12 
<211> 116 
<212> DNA 

<213> Escherichia coli 
<220> 

<221> misc_f eature 
<222> (0) . . . (0) 

<223> partial promoter sequence 
<400> 12 

cgcggtcaga aaattatttt aaatttcctc ttgtcaggcc ggaataactc cctataatgc 60 
gccaccactg acacggaaca acggcaaaca cgccgccggg tcagcggggt tctcct 116 

<210> 13 



5 



<211> 22 
<212> DNA 

<213> Escherichia coli 
<220> 

<221> misc_f eature 
<222> (0) . . . (0) 

<223> partial promoter sequence 
<400> 13 

agaaaattat tttaaatttc ct 

<210> 14 

<211> 22 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> modified promoter sequence 

<400> 14 

gactgcagtg gtacctagga gg 

<210> 15 

<211> 22 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> modified promoter sequence 

<400> 15 

agaaaattat tttaaatttc ct 

<210> 16 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> modified promoter sequence 
<400> 16 

ggaaaatttt ttttcaaaag ta 

<210> 17 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> modified promoter sequence 
<400> 17 

tgaaatttat tttgcgaaag gg 



<210> 18 



<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered DNA response element 
<400> 18 

tgttcgcact t 11 

<210> 19 
<211> 52 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered DNA response element 
<400> 19 

catggacgcc actgagccgt ttttgttcgc acttgaggcg agtcgatgca cc 52 

<210> 20 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered DNA response element 
<400> 20 

catggacgcc actgagccgt gttcgcactt ttttttgagg cgagtcgatg cacc 54 

<210> 21 
<211> 58 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered DNA response element 
<400> 21 

catggacgcc actgagccgt ttttgttcgc actttttttt gaggcgagtc gatgcacc 58 

<210> 22 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered DNA response element 
<400> 22 

cttaaaaata ac 12 

<210> 23 
<211> 16 
<212> DNA 



7 



<213> Artificial Sequence 



<220> 

<223> engineered DNA response element 
<400> 23 

ttgaaaaatc aacgct 16 

<210> 24 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered DNA response element 
<400> 24 

tttttgttcg cacttttttt t 21 

<210> 25 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered DNA response element 
<400> 25 

tttttgggat tttccttttt 20 

<210> 26 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered DNA response element 
<400> 26 

aaaaaattgt gagcgctcac aatttttt 2 8 

<210> 27 
<211> 6 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> tissue-specific transcription factor 
<400> 27 

acttta 6 

<210> 28 
<211> 9 
<212> DNA 

<213> Artificial Sequence 



8 



<220> 

<223> engineered DNA response element 



<400> 28 
taccgacat 



<210> 29 
<211> 10 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> engineered DNA response element 



<400> 29 
gggactttcc 



<210> 30 

<211> 10 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> engineered DNA response element 



<400> 30 
gggattttcc 

<210> 31 
<211> 50 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> engineered DNA response element 
<400> 31 

cgaccgtgct cgagttaacg ggactttcca aaaacgatcg gactggact 

<210> 32 
<211> 50 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered DNA response element 
<400> 32 

cgaccgtgct cgagttaacg ggattttcca aaaacgatcg gactggact 

<210> 33 
<211> 50 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered DNA response element 



<400> 33 

cgaccgtgct cgagaaattg ggattttcca aaaacgatcg gactggactc 50 

<210> 34 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered DNA response element 

<400> 34 

aaaaaattgt gagcgctcac aatttttt 28 

<210> 35 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered DNA response element 
<400> 35 

ttttttttgt gagcggataa caaaa 25 

<210> 36 
<211> 10 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered DNA response element 



<400> 36 
tctgggatcc 

<210> 37 
<211> 14 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> engineered DNA response element 

<400> 37 
gagttttttt taag 

<210> 38 
<211> 14 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered DNA response element 



<400> 38 



10 



gagttttaaa agag 



14 



<210> 39 
<211> 969 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> VARIANT 
<222> (0) . . . (0) 

<223> transcriptional regulatory protein 



<400> 39 



Met 


Ala 


Glu 


Asp 


Asp 


Pro 


Tvr 


Leu 


Gly 


Arg 


Pro 


Glu 


Gin 


Met 


Phe 


His 


1 








5 










10 










15 




Leu 


Asp 


Pro 


Ser 


Leu 


Thr 


His 


Thr 


He 


Phe 


Asn 


Pro 


Glu 


Val 


Phe 


Gin 








20 










25 










30 






Pro 


Gin 


Met 


Ala 


Leu 


Pro 


Thr 


Ala 


Asp 


Gly 


Pro 


Tyr 


Leu 


Gin 


He 


Leu 






35 










40 










45 








Glu 


Gin 


Pro 


Lvs 


Gin 


Ara 


Gly 


Phe 


Arg 


Phe 


Arg 


Tyr 


Val 


Cys 


Glu 


Gly 




50 










55 










60 










Pro 


Ser 


His 


Gly 


Gly 


Leu 


Pro 


Gly 


Ala 


Ser 


Ser 


Glu 


Lys 


Asn 


Lys 


Lys 


65 










70 










75 










80 


Ser 


Tvr 


Pro 


Gin 


Val 


Lvs 


He 


Cys 


Asn 


Tyr 


Val 


Gly 


Pro 


Ala 


Lys 


Val 










85 










90 










95 




He 


Val 


Gin 


Leu 


Val 


Thr 


Asn 


Gly 


Lvs 


Asn 


He 


His 


Leu 


His 


Ala 


His 








100 










105 










110 






Ser 


Leu 


Val 


Gly 


Lvs 


His 


Cys 


Glu 


Asp 


Gly 


He 


Cys 


Thr 


Val 


Thr 


Ala 






115 










120 










125 








Gly 


Pro 


Lvs 


Asp 


Met 


Val 


Val 


Gly 


Phe 


Ala 


Asn 


Leu 


Gly 


He 


Leu 


His 




130 










135 










140 










Val 


Thr 


Lvs 


Lys 


Lvs 

Jt 


Val 


Phe 


Glu 


Thr 


Leu 


Glu 


Ala 


Arg 


Met 


Thr 


Glu 


145 










150 










155 










160 


Ala 


Cys 


He 


Arg 


Gly 


Tvr 


Asn 


Pro 


Gly 


Leu 


Leu 


Val 


His 


Pro Asp 


Leu 










165 










170 










175 




Ala 


Tyr 


Leu 


Gin 


Ala 


Glu 


Glv 


Gly 


Gly 


Asp 


Arg 


Gin 


Leu 


Gly Asp Arg 








180 










185 










190 






Glu 


Lys 


Glu 


Leu 


He 


Arg 


Gin 


Ala 


Ala 


Leu 


Gin 


Gin 


Thr 


Lys 


Glu 


Met 






195 










200 










205 








Asp 


Leu 


Ser 


Val 


Val 


Arg 


Leu 


Met 


Phe 


Thr 


Ala 


Phe 


Leu 


Pro 


Asp 


Ser 




210 










215 










220 










Thr 


Gly 


Ser 


Phe 


Thr 


Arg 


Arg 


Leu 


Glu 


Pro 


Val 


Val 


Ser 


Asp 


Ala 


He 


225 










230 










235 










240 


Tyr Asp 


Ser 


Lys 


Ala 


Pro 


Asn 


Ala 


Ser 


Asn 


Leu 


Lys 


He 


Val 


Arg 


Met 










245 










250 










255 




Asp Arg 


Thr 


Ala 


Gly 


Cys 


Val 


Thr 


Gly 


Gly 


Glu 


Glu 


He 


Tyr 


Leu 


Leu 








260 










265 










270 






Cys 


Asp 


Lys 


Val 


Gin 


Lys 


Asp 


Asp 


He 


Gin 


He 


Arg 


Phe 


Tyr 


Glu 


Glu 






275 










280 










285 








Glu 


Glu 


Asn 


Gly 


Gly 


Val 


Trp 


Glu 


Gly 


Phe 


Gly Asp 


Phe 


Ser 


Pro 


Thr 




290 










295 










300 










Asp 


Val 


His 


Arg 


Gin 


Phe 


Ala 


He 


Val 


Phe 


Lys 


Thr 


Pro 


Lys 


Tyr 


Lys 


305 










310 










315 










320 


Asp 


He 


Asn 


He 


Thr 


Lys 


Pro 


Ala 


Ser 


Val 


Phe 


Val 


Gin 


Leu Arg Arg 








325 










330 










335 




Lys 


Ser 


Asp 


Leu 


Glu 


Thr 


Ser 


Glu 


Pro 


Lys 


Pro 


Phe 


Leu 


Tyr 


Tyr 


Pro 






340 










345 










350 
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Glu lie Lys Asp 
355 

Pro Asn Phe Ser 
370 

Gly Gly Gly Met 
385 

Thr Gly Pro Gly 

Gly He Thr Phe 
420 

His Gly Thr Met 
435 

Lys Ser Asp Asp 
450 

Thr Thr Glu Gin 
465 

Glu Val Thr Leu 

Val Gin Asp Asn 
500 

His Ala Asn Ala 
515 

Leu Leu Ala Val 
530 

Asp Ser Val Leu 
545 

Arg Asp Leu Leu 

Asn Met Arg Asn 
580 

Thr Lys Gin Glu 
595 

Leu Ser Leu Leu 
610 

Lys Glu Gly His 
625 

Ala Ala Leu Leu 

His Leu Ala Met 
660 

Ala Ala Gly Ala 
675 

Ala Leu His Leu 
690 

Leu Leu Leu Glu 
705 

Thr Thr Pro Leu 

Ala Leu Leu Lys 
740 

Pro Leu Tyr Asp 
755 

Gly Val Val Pro 
770 

Val Phe Asp He 
785 

Asp Asp Leu Leu 



Lys Glu Glu Val 
360 

Asp Ser Phe Gly 
375 

Phe Gly Ser Gly 
390 

Tyr Ser Phe Pro 
405 

His Pro Gly Thr 

Asp Thr Glu Ser 
440 

Lys Asn Thr Val 
455 

Asp Gin Glu Pro 
470 

Thr Tyr Ala Thr 
485 

Leu Phe Leu Glu 

Leu Phe Asp Tyr 
520 

Gin Arg His Leu 
535 

His Leu Ala He 
550 

Glu Val Thr Ser 
565 

Asp Leu Tyr Gin 

Asp Val Val Glu 
600 

Asp Arg Leu Gly 
615 

Asp Lys Val Leu 
630 

Leu Asp His Pro 
645 

Met Ser Asn Ser 

Asp Val Asn Ala 
680 

Ala Val Glu His 
695 

Gly Asp Ala His 
710 

His He Ala Ala 
725 

Ala Ala Gly Ala 

Leu Asp Asp Ser 
760 

Gly Thr Thr Pro 
775 

Leu Asn Gly Lys 
790 

Ala Gin Gly Asp 



Gin Arg Lys Arg 

Gly Gly Ser Gly 
380 

Gly Gly Gly Gly 
395 

His Tyr Gly Phe 
410 

Thr Lys Ser Asn 
425 

Lys Lys Asp Pro 

Asn Leu Phe Gly 
460 

Ser Glu Ala Thr 
475 

Gly Thr Lys Glu 
490 

Lys Ala Met Gin 
505 

Ala Val Thr Gly 

Thr Ala Val Gin 
540 

He His Leu His 
555 

Gly Leu He Ser 
570 

Thr Pro Leu His 
585 

Asp Leu Leu Arg 

Asn Ser Val Leu 
620 

Ser He Leu Leu 
635 

Asn Gly Asp Gly 
650 

Leu Pro Cys Leu 
665 

Gin Glu Gin Lys 

Asp Asn He Ser 
700 

Val Asp Ser Thr 
715 

Gly Arg Gly Ser 
730 

Asp Pro Leu Val 
745 

Trp Glu Asn Ala 

Leu Asp Met Ala 
780 

Pro Tyr Glu Pro 
795 

Met Lys Gin Leu 



Gin Lys Leu Met 
365 

Ala Gly Ala Gly 

Gly Thr Gly Ser 
400 

Pro Thr Tyr Gly 
415 

Ala Gly Met Lys 
430 

Glu Gly Cys Asp 
445 

Lys Val He Glu 

Val Gly Asn Gly 
480 

Glu Ser Ala Gly 
495 

Leu Ala Lys Arg 
510 

Asp Val Lys Met 
525 

Asp Glu Asn Gly 

Ser Gin Leu Val 
560 

Asp Asp He He 
575 

Leu Ala Val He 
590 

Ala Gly Ala Asp 
605 

His Leu Ala Ala 

Lys His Lys Lys 
640 

Leu Asn Ala lie 
655 

Leu Leu Leu Val 
670 

Ser Gly Arg Thr 
685 

Leu Ala Gly Cys 

Thr Tyr Asp Gly 
720 

Thr Arg Leu Ala 
735 

Glu Asn Phe Glu 
750 

Gly Glu Asp Glu 
765 

Thr Ser Trp Gin 

Glu Phe Thr Ser 
800 

Ala Glu Asp Val 
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805 810 815 



Lys Leu 


Gin 


Leu 


Tyr 


Lvs 


Leu 


Leu 


Glu 


He 


Pro 


Asp 


Pro 


Asp 


Lvs 


Asn 






820 










825 










830 






Trp Ala 


Thr 


Leu 


Ala 


Gin 


Lvs 


Leu 


Glv 


Leu 


Glv 


lie 


Leu 


Asn 


Asn 


Ala 




835 










840 










845 








Phe Arg 


Leu 


Ser 


Pro 


Ala 


Pro 


Ser 


Lvs 


Thr 


Leu 


Met 


Asp 


Asn 


Tvr 


Glu 


850 










855 










860 










Val Ser 


Glv 


Gly Thr 


Val 


Arg 


Glu 


Leu 


Val 


Glu 


Ala 


Leu Arg 


Gin 


Met 


865 








870 










875 










880 


Glv Tvr 


Thr 


Glu 


Ala 


He 


Glu 


Val 


He 


Gin 


Ala 


Ala 


Ser 


Ser 


Pro 


Val 








885 










890 










895 




Lys Thr 


Thr 


Ser 


Gin 


Ala 


His 


Ser 


Leu 


Pro 


Leu 


Ser 


Pro 


Ala 


Ser 


Thr 






900 










905 










910 






Arg Gin 


Gin 


He 


Asp 


Glu 


Leu 


Arg 


Asp 


Ser 


Asp 


Ser 


Val 


Cys 


Asp 


Thr 




915 










920 










925 








Gly Val 


Glu 


Thr 


Ser 


Phe 


Arg 


Lys 


Leu 


Ser 


Phe 


Thr 


Glu 


Ser 


Leu 


Thr 


930 










935 










940 










Ser Gly 


Ala 


Ser 


Leu 


Leu 


Thr 


Leu 


Asn 


Lys 


Met 


Pro 


His 


Asp 


Tyr 


Gly 


945 








950 










955 










960 


Gin Glu 


Gly 


Pro 


Leu 


Glu 


Gly 


Lys 


He 

















965 



<210> 40 
<211> 96 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered regulatory sequence 
<400> 40 

gctagccccg ccccgttgac gcaaatgggc ggtaggcgtg tacggtggga ggtttatata 60 
agcagagctc gtttagtgaa ccgtcagatc agatct 96 

<210> 41 
<211> 154 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered regulatory sequence 



<400> 41 

gctagcgccc aaattgggat tttccaaaaa gccgaaattg ggattttcca aaaaccgccg 60 

atcgcccgcc ccgttgacgc aaatgggcgg taggcgtgta cggtgggagg tttatataag 120 

cagagctcgt ttagtgaacc gtcagatcag atct 154 



<210> 42 
<211> 212 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered regulatory sequence 
<400> 42 

acgcgtgccc aaattgggat tttccaaaaa gccgaaattg ggattttcca aaaaccgcgc 60 
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tagcgcccaa attgggattt tccaaaaagc cgaaattggg attttccaaa aaccgccgat 120 
cgcccgcccc gttgacgcaa atgggcggta ggcgtgtacg gtgggaggtt tatataagca 180 
gagctcgttt agtgaaccgt cagatcagat ct 212 



<210> 43 
<211> 96 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered regulatory sequence 
<400> 43 

gctagccccg ccccgttgac gcaaatgggc ggtaggcgtg tacggtggga ggtctatata 60 
agcagagctc gtttagtgaa ccgtcagatc agatct 96 

<210> 44 
<211> 154 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered regulatory sequence 



<400> 44 

gctagcgccc aggtcgggat tttccgagga gccgaggtcg ggattttccg aggaccgccg 60 

atcgcccgcc ccgttgacgc aaatgggcgg taggcgtgta cggtgggagg cctatataag 120 

cagagctcgt ttagtgaacc gtcagatcag atct 154 



<210> 45 
<211> 154 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered regulatory sequence 



<400> 45 

gctagcgccc aggtcgggat tttccgagga gccgaggtcg ggattttccg aggaccgccg 60 

atcgcccgcc ccgttgacgc aaatgggcgg taggcgtgta cggtgggagg cctatataag 120 

cagagctcgt ttagtgaacc gtcagatcag atct 154 



<210> 46 
<211> 762 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered promoter construct 



<400> 46 

ggtacctcaa tattggccat tagccatatt attcattggt tatatagcat aaattaatat 60 

tggctattgg ccattgcata cgttgtatct atatcataat atgtacattt atattggctc 120 

atgtccaata tgaccgccat gttggcattg attattgact agttattaat agtaatcaat 180 

tacggggtca ttagttcata gcccatatat ggagttccgc gttacataac ttacggtaaa 240 

tggcccgcct ggctgaccgc ccaacgaccc ccgcccattg acgtcaataa tgacgtatgt 300 

tcccatagta acgcaaatag ggattttcca ttaacgtcaa tgggtggagt atttacggta 360 
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aactgcccac ttggcagtac atcaagtgta tcatatgcca agtccgcccc ctattgacgt 420 

caatgacggt aaatggcccg cctggcatta tgcccagtac atgactttat gggattttcc 4 80 

tatttggcag tacatctacg tattagtcat cgctattacc atggtgatgc ggttttggca 540 

gtacaccaat gggcgtggat agcggtttga ctcacgggga tttccaagtc tccaccccat 600 

tgacgtcaat gggagtttgt tttggcacca aggtaaaagg gattttccaa aatgtcgtaa 660 

caactgcgat cgcccgcccc gttgacgcaa atgggcggta ggcgtgtacg gtgggaggtt 720 

tatataagca gagctcgttt agtgaaccgt cagatcaagc tt 762 



<210> 47 
<211> 762 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered promoter construct 



<400> 47 

ggtacctcaa tattggccat tagccatatt attcattggt tatatagcat aaattaatat 60 

tggctattgg ccattgcata cgttgtatct atatcataat atgtacattt atattggctc 120 

atgtccaata tgaccgccat gttggcattg attattgact agttattaat agtaatcaat 180 

tacggggtca ttagttcata gcccatatat ggagttccgc gttacataac ttacggtaaa 240 

tggcccgcct ggctgaccgc ccaacgaccc ccgcccattg acgtcaataa tgacgtatgt 300 

tcccatagta acgcaaatat tcccgggaaa ttaacgtcaa tgggtggagt atttacggta 360 

aactgcccac ttggcagtac atcaagtgta tcatatgcca agtccgcccc ctattgacgt 420 

caatgacggt aaatggcccg cctggcatta tgcccagtac atgactttat tctcgaggaa 480 

tatttggcag tacatctacg tattagtcat cgctattacc atggtgatgc ggttttggca 540 

gtacaccaat gggcgtggat agcggtttga ctcacgggga tttccaagtc tccaccccat 600 

tgacgtcaat gggagtttgt tttggcacca aggtaaaatt acgcgtaaaa aatgtcgtaa 660 

caactgcgat cgcccgcccc gttgacgcaa atgggcggta ggcgtgtacg gtgggaggtt 720 

gctagccgca gagctcgttt agtgaaccgt cagatcaagc tt 762 



<210> 48 
<211> 762 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered promoter construct 



<400> 48 

ggtacctcaa tattggccat tagccatatt attcattggt tatatagcat aaatcaatat 60 

tggctattgg ccattgcata cgttgtatct atatcataat atgtacattt atattggctc 120 

atgtccaata tgaccgccat gttggcattg attattgact agttattaat agtaatcaat 180 

tacggggtca ttagttcata gcccatatat ggagttccgc gttacataac ttacggtaaa 240 

tggcccgcct ggctgaccgc ccaacgaccc ccgcccattg acgtcaataa tgacgtatgt 300 

tcccatagta acgccaatag ggactttcca ttgacgtcaa tgggtggagt atttacggta 360 

aactgcccac ttggcagtac atcaagtgta tcatatgcca agtccgcccc ctattgacgt 420 

caatgacggt aaatggcccg cctggcatta tgcccagtac atgaccttac gggactttcc 480 

tacttggcag tacatctacg tattagtcat cgctattacc atggtgatgc ggttttggca 540 

gtacaccaat gggcgtggat agcggtttga ctcacgggga tttccaagtc tccaccccat 600 

tgacgtcaat gggagtttgt tttggcacca aaatcaacgg gactttccaa aatgtcgtaa 660 

caactgcgat cgcccgcccc gttgacgcaa atgggcggta ggcgtgtacg gtgggaggtc 720 

tatataagca gagctcgttt agtgaaccgt cagatcaagc tt 762 



<210> 49 
<211> 12 
<212> DNA 
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<213> Artificial Sequence 



<220> 

<223> wild type regulatory sequence 
<400> 49 

gactgtttgt tt 12 

<210> 50 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> wild type regulatory sequence 
<400> 50 

aggactcttg ga 12 

<210> 51 
<211> 46 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> wild type regulatory sequence 
<400> 51 

tactaggagg ctgtaggcat aaattggtct gcgcaccagc accatg 46 

<210> 52 
<211> 46 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered regulatory sequence 
<400> 52 

tactaggagg ctgtaggcat aaattagtct gcgcaccagc accatg 46 

<210> 53 
<211> 46 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered regulatory sequence 
<400> 53 

tactaggatt agtgcttaag cccttggtct gcgcaccagc accatg 46 

<210> 54 
<211> 46 
<212> DNA 

<213> Artificial Sequence 
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J 



<220> 

<223> engineered regulatory sequence 



<400> 54 

tactaggagg ctgtaggcat aaagctcgag tatacaacgc accatg 46 

<210> 55 
<211> 50 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered regulatory sequence 
<400> 55 

tactaggagg ctgtaggcat aaatgcgtaa aagcaccagc accatgcaac 50 

<210> 56 
<211> 50 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered regulatory sequence 
<400> 56 

tactaggagg ctgtaggcat aaattaaaaa acgcaccagc accatgcaac 50 

<210> 57 
<211> 50 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered regulatory sequence 
<400> 57 

tactaggagg ctgtaggcat aaattaatcc gcgcaccagc accatgcaac 50 

<210> 58 
<211> 51 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered regulatory sequence 
<400> 58 

accttgaggc atacttcaaa gactgttgat ttagcgaata agaggagttg g 51 

<210> 59 
<211> 51 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered regulatory sequence 
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<400> 59 

accttgaggc atacttcaaa gactgtttat tttaataacg ggaggagttg g 



51 



<210> 60 
<211> 51 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered regulatory sequence 
<400> 60 

accttgaggc atacttcaaa gactgtttat ttaaggactg ggaggagttg g 51 

<210> 61 
<211> 6513 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> heterologous nucleic acid construct 



<400> 61 

tcaatattgg ccattagcca tattattcat tggttatata gcataaatca atattggcta 60 

ttggccattg catacgttgt atctatatca taatatgtac atttatattg gctcatgtcc 120 

aatatgaccg ccatgttggc attgattatt gactagttat taatagtaat caattacggg 180 

gtcattagtt catagcccat atatggagtt ccgcgttaca taacttacgg taaatggccc 240 

gcctggctga ccgcccaacg acccccgccc attgacgtca ataatgacgt atgttcccat 300 

agtaacgcca atagggactt tccattgacg tcaatgggtg gagtatttac ggtaaactgc 360 

ccacttggca gtacatcaag tgtatcatat gccaagtccg ccccctattg acgtcaatga 420 

cggtaaatgg cccgcctggc attatgccca gtacatgacc ttacgggact ttcctacttg 480 

gcagtacatc tacgtattag tcatcgctat taccatggtg atgcggtttt ggcagtacac 540 

caatgggcgt ggatagcggt ttgactcacg gggatttcca agtctccacc ccattgacgt 600 

caatgggagt ttgttttggc accaaaatca acgggacttt ccaaaatgtc gtaacaactg 660 

cgatcgcccg ccccgttgac gcaaatgggc ggtaggcgtg tacggtggga ggtctatata 720 

agcagagctc gtttagtgaa ccgtcagatc actagaagct ttattgcggt agtttatcac 780 

agttaaattg ctaacgcagt cagtgcttct gacacaacag tctcgaactt aagctgcagt 840 

gactctctta aggtagcctt gcagaagttg gtcgtgaggc actgggcagg taagtatcaa 900 

ggttacaaga caggtttaag gagaccaata gaaactgggc ttgtcgagac agagaagact 960 

cttgcgtttc tgataggcac ctattggtct tactgacatc cactttgcct ttctctccac 1020 

aggtgtccac tcccagttca attacagctc ttaaggctag agtacttaat acgactcact 1080 

ataggctagc cagcttgaag caagcctcct gaaagatgga ggcgtcgctg ccggcccagg 1140 

ccgccgagac ggaggaggtg ggtcttttcg tcgaaaaata cctccggtcc gatgtcgcgc 1200 

cggcggaaat tgtcgcgctc atgcgcaacc tcaacagcct gatgggacgc acgcggttta 1260 

tttacctggc gttgctggag gcctgtctcc gcgttcccat ggccacccgc agcagcgcca 1320 

tatttcggcg gatctatgac cactacgcca cgggcgtcat ccccacgatc aacgtcaccg 1380 

gagagctgga gctcgtggcc ctgcccccca ccctgaacgt aacccccgtc tgggagctgt 1440 

tgtgcctgtg cagcaccatg gccgcgcgcc tgcattggga ctcggcggcc gggggatctg 1500 

ggaggacctt cggccccgat gacgtgctgg acctactgac cccccactac gaccgctaca 1560 

tgcagctggt gttcgaactg ggccactgta acgtaaccga cggacttctg ctctcggagg 1620 

aagccgtcaa gcgcgtcgcc gacgccctaa gcggctgtcc cccgcgcggg tccgttagcg 1680 

agacggacca cgcggtggcg ctgttcaaga taatctgggg cgaactgttt ggcgtgcaga 1740 

tggccaaaag cacgcagacg tttcccgggg cggggcgcgt taaaaacctc accaaacaga 1800 

caatcgtggg gttgttggac gcccaccaca tcgaccacag cgcctgccgg acccacaggc 1860 

agctgtacgc cctgcttatg gcccacaagc gggagtttgc gggcgcgcgc ttcaagctac 1920 

gcgtgcccgc gtgggggcgc tgtttgcgca cgcactcatc cagcgccaac cccaacgctg 1980 
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acatcatcct ggaggcggcg ctgtcggagc tccccaccga ggcctggccc atgatgcagg 2040 

gggcggtgaa ctttagcacc ctaatgaagc tactgtcttc tatcgaacaa gcatgcccaa 2100 

aaaagaagag aaaggtagat gaattcccgg ggatctcgac ggcccccccg accgatgtca 2160 

gcctggggga cgagctccac ttagacggcg aggacgtggc gatggcgcat gccgacgcgc 2220 

tagacgattt cgatctggac atgttggggg acggggattc cccgggtccg ggatcgccag 2280 

ggatccgtcg acttgacgcg ttgatatcat ctagagcggc cgcaggtacc tgaataacta 2340 

aggccgcttc cctttagtga gggttaatgc ttcgagcaga catgataaga tacattgatg 2400 

agtttggaca aaccacaact agaatgcagt gaaaaaaatg ctttatttgt gaaatttgtg 2460 

atgctattgc tttatttgta accattataa gctgcaataa acaagttaac aacaacaatt 2520 

gcattcattt tatgtttcag gttcaggggg agatgtggga ggttttttaa agcaagtaaa 2580 

acctctacaa atgtggtaaa atccgataag gatcgattcc ggagcctgaa tggcgaatgg 2640 

acgcgccctg tagcggcgca ttaagcgcgg cgggtgtggt ggttacgcgc acgtgaccgc 2700 

tacacttgcc agcgccctag cgcccgctcc tttcgctttc ttcccttcct ttctcgccac 2760 

gttcgccggc tttccccgtc aagctctaaa tcgggggctc cctttagggt tccgatttag 2820 

tgctttacgg cacctcgacc ccaaaaaact tgattagggt gatggttcac gtagtgggcc 2880 

atcgccctga tagacggttt ttcgcccttt gacgttggag tccacgttct ttaatagtgg 2940 

actcttgttc caaactggaa caacactcaa ccctatctcg gtctattctt ttgatttata 3000 

agggattttg ccgatttcgg cctattggtt aaaaaatgag ctgatttaac aaaaatttaa 3060 

cgcgaatttt aacaaaatat taacgcttac aatttcgcct gtgtaccttc tgaggcggaa 3120 

agaaccagct gtggaatgtg tgtcagttag ggtgtggaaa gtccccaggc tccccagcag 3180 

gcagaagtat gcaaagcatg catctcaatt agtcagcaac caggtgtgga aagtccccag 3240 

gctccccagc aggcagaagt atgcaaagca tgcatctcaa ttagtcagca accatagtcc 3300 

cgcccctaac tccgcccatc ccgcccctaa ctccgcccag ttccgcccat tctccgcccc 3360 

atggctgact aatttttttt atttatgcag aggccgaggc cgcctcggcc tctgagctat 3420 

tccagaagta gtgaggaggc ttttttggag gcctaggctt ttgcaaaaag cttgattctt 3480 

ctgacacaac agtctcgaac ttaaggctag agccaccatg attgaacaag atggattgca 3540 

cgcaggttct ccggccgctt gggtggagag gctattcggc tatgactggg cacaacagac 3600 

aatcggctgc tctgatgccg ccgtgttccg gctgtcagcg caggggcgcc cggttctttt 3660 

tgtcaagacc gacctgtccg gtgccctgaa tgaactgcag gacgaggcag cgcggctatc 3720 

gtggctggcc acgacgggcg ttccttgcgc agctgtgctc gacgttgtca ctgaagcggg 37 80 

aagggactgg ctgctattgg gcgaagtgcc ggggcaggat ctcctgtcat ctcaccttgc 3840 

tcctgccgag aaagtatcca tcatggctga tgcaatgcgg cggctgcata cgcttgatcc 3900 

ggctacctgc ccattcgacc accaagcgaa acatcgcatc gagcgagcac gtactcggat 3960 

ggaagccggt cttgtcgatc aggatgatct ggacgaagag catcaggggc tcgcgccagc 4020 

cgaactgttc gccaggctca aggcgcgcat gcccgacggc gaggatctcg tcgtgaccca 4 080 

tggcgatgcc tgcttgccga atatcatggt ggaaaatggc cgcttttctg gattcatcga 414 0 

ctgtggccgg ctgggtgtgg cggaccgcta tcaggacata gcgttggcta cccgtgatat 4200 

tgctgaagag cttggcggcg aatgggctga ccgcttcctc gtgctttacg gtatcgccgc 4260 

tcccgattcg cagcgcatcg ccttctatcg ccttcttgac gagttcttct gagcgggact 4320 

ctggggttcg aaatgaccga ccaagcgacg cccaacctgc catcacgatg gccgcaataa 4380 

aatatcttta ttttcattac atctgtgtgt tggttttttg tgtgaagatc cgcgtatggt 4440 

gcactctcag tacaatctgc tctgatgccg catagttaag ccagccccga cacccgccaa 4500 

cacccgctga cgcgccctga cgggcttgtc tgctcccggc atccgcttac agacaagctg 4560 

tgaccgtctc cgggagctgc atgtgtcaga ggttttcacc gtcatcaccg aaacgcgcga 4620 

gacgaaaggg cctcgtgata cgcctatttt tataggttaa tgtcatgata ataatggttt 4680 

cttagacgtc aggtggcact tttcggggaa atgtgcgcgg aacccctatt tgtttatttt 4740 

tctaaataca ttcaaatatg tatccgctca tgagacaata accctgataa atgcttcaat 4800 

aatattgaaa aaggaagagt atgagtattc aacatttccg tgtcgccctt attccctttt 4860 

ttgcggcatt ttgccttcct gtttttgctc acccagaaac gctggtgaaa gtaaaagatg 4920 

ctgaagatca gttgggtgca cgagtgggtt acatcgaact ggatctcaac agcggtaaga 4 980 

tccttgagag ttttcgcccc gaagaacgtt ttccaatgat gagcactttt aaagttctgc 5040 

tatgtggcgc ggtattatcc cgtattgacg ccgggcaaga gcaactcggt cgccgcatac 5100 

actattctca gaatgacttg gttgagtact caccagtcac agaaaagcat cttacggatg 5160 

gcatgacagt aagagaatta tgcagtgctg ccataaccat gagtgataac actgcggcca 5220 

acttacttct gacaacgatc ggaggaccga aggagctaac cgcttttttg cacaacatgg 5280 

gggatcatgt aactcgcctt gatcgttggg aaccggagct gaatgaagcc ataccaaacg 5340 

acgagcgtga caccacgatg cctgtagcaa tggcaacaac gttgcgcaaa ctattaactg 5400 
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gcgaactact tactctagct tcccggcaac aattaataga ctggatggag gcggataaag 5460 

ttgcaggacc acttctgcgc tcggcccttc cggctggctg gtttattgct gataaatctg 5520 

gagccggtga gcgtgggtct cgcggtatca ttgcagcact ggggccagat ggtaagccct 5580 

cccgtatcgt agttatctac acgacgggga gtcaggcaac tatggatgaa cgaaatagac 5640 

agatcgctga gataggtgcc tcactgatta agcattggta actgtcagac caagtttact 5700 

catatatact ttagattgat ttaaaacttc atttttaatt taaaaggatc taggtgaaga 5760 

tcctttttga taatctcatg accaaaatcc cttaacgtga gttttcgttc cactgagcgt 5820 

cagaccccgt agaaaagatc aaaggatctt cttgagatcc tttttttctg cgcgtaatct 5880 

gctgcttgca aacaaaaaaa ccaccgctac cagcggtggt ttgtttgccg gatcaagagc 5940 

taccaactct ttttccgaag gtaactggct tcagcagagc gcagatacca aatactgtcc 6000 

ttctagtgta gccgtagtta ggccaccact tcaagaactc tgtagcaccg cctacatacc 6060 

tcgctctgct aatcctgtta ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg 6120 

ggttggactc aagacgatag ttaccggata aggcgcagcg gtcgggctga acggggggtt 6180 

cgtgcacaca gcccagcttg gagcgaacga cctacaccga actgagatac ctacagcgtg 6240 

agctatgaga aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg 6300 

gcagggtcgg aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt 6360 

atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag 6420 

gggggcggag cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt 6480 

gctggccttt tgctcacatg gctcgacaga tct 6513 



<210> 62 
<211> 6439 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> heterologous nucleic acid construct 



<400> 62 

tcaatattgg ccattagcca tattattcat tggttatata gcataaatca atattggcta 60 

ttggccattg catacgttgt atctatatca taatatgtac atttatattg gctcatgtcc 120 

aatatgaccg ccatgttggc attgattatt gactagttat taatagtaat caattacggg 180 

gtcattagtt catagcccat atatggagtt ccgcgttaca taacttacgg taaatggccc 240 

gcctggctga ccgcccaacg acccccgccc attgacgtca ataatgacgt atgttcccat 300 

agtaacgcca atagggactt tccattgacg tcaatgggtg gagtatttac ggtaaactgc 360 

ccacttggca gtacatcaag tgtatcatat gccaagtccg ccccctattg acgtcaatga 420 

cggtaaatgg cccgcctggc attatgccca gtacatgacc ttacgggact ttcctacttg 480 

gcagtacatc tacgtattag tcatcgctat taccatggtg atgcggtttt ggcagtacac 540 

caatgggcgt ggatagcggt ttgactcacg gggatttcca agtctccacc ccattgacgt 600 

caatgggagt ttgttttggc accaaaatca acgggacttt ccaaaatgtc gtaacaactg 660 

cgatcgcccg ccccgttgac gcaaatgggc ggtaggcgtg tacggtggga ggtctatata 720 

agcagagctc gtttagtgaa ccgtcagatc actagaagct ttattgcggt agtttatcac 780 

agttaaattg ctaacgcagt cagtgcttct gacacaacag tctcgaactt aagctgcagt 840 

gactctctta aggtagcctt gcagaagttg gtcgtgaggc actgggcagg taagtatcaa 900 

ggttacaaga caggtttaag gagaccaata gaaactgggc ttgtcgagac agagaagact 960 

cttgcgtttc tgataggcac ctattggtct tactgacatc cactttgcct ttctctccac 1020 

aggtgtccac tcccagttca attacagctc ttaaggctag agtacttaat acgactcact 1080 

ataggctagc cagcttgaag caagcctcct gaaagatgga ggcgtcgctg ccggcccagg 1140 

ccgccgagac ggaggaggtg ggtcttttcg tcgaaaaata cctccggtcc gatgtcgcgc 1200 

cggcggaaat tgtcgcgctc atgcgcaacc tcaacagcct gatgggacgc acgcggttta 12 60 

tttacctggc gttgctggag gcctgtctcc gcgttcccat ggccacccgc agcagcgcca 1320 

tatttcggcg gatctatgac cactacgcca cgggcgtcat ccccacgatc aacgtcaccg 1380 

gagagctgga gctcgtggcc ctgcccccca ccctgaacgt aacccccgtc tgggagctgt 1440 

tgtgcctgtg cagcaccatg gccgcgcgcc tgcattggga ctcggcggcc gggggatctg 1500 

ggaggacctt cggccccgat gacgtgctgg acctactgac cccccactac gaccgctaca 1560 

tgcagctggt gttcgaactg ggccactgta acgtaaccga cggacttctg ctctcggagg 1620 

aagccgtcaa gcgcgtcgcc gacgccctaa gcggctgtcc cccgcgcggg tccgttagcg 1680 
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agacggacca cgcggtggcg ctgttcaaga taatctgggg cgaactgttt ggcgtgcaga 1740 

tggccaaaag cacgcagacg tttcccgggg cggggcgcgt taaaaacctc accaaacaga 1800 

caatcgtggg gttgttggac gcccaccaca tcgaccacag cgcctgccgg acccacaggc 1860 

agctgtacgc cctgcttatg gcccacaagc gggagtttgc gggcgcgcgc ttcaagctac 1920 

gcgtgcccgc gtgggggcgc tgtttgcgca cgcactcatc cagcgccaac cccaacgctg 1980 

acatcatcct ggaggcggcg ctgtcggagc tccccaccga ggcctggccc atgatgcagg 204 0 

gggcggtgaa ctttagcacc ctaccaaaaa agaagagaaa ggtagatcgg acactggtga 2100 

ccttcaagga tgtatttgtg gacttcacca gggaggagtg gaagctgctg gacactgctc 2160 

agcagatcgt gtacagaaat gtgatgctgg agaactataa gaacctggtt tccttgggtt 2220 

attgatgaga tatcatctag agcggccgca ggtacctgaa taactaaggc cgcttccctt 2280 

tagtgagggt taatgcttcg agcagacatg ataagataca ttgatgagtt tggacaaacc 2340 

acaactagaa tgcagtgaaa aaaatgcttt atttgtgaaa tttgtgatgc tattgcttta 2400 

tttgtaacca ttataagctg caataaacaa gttaacaaca acaattgcat tcattttatg 2460 

tttcaggttc agggggagat gtgggaggtt ttttaaagca agtaaaacct ctacaaatgt 2520 

ggtaaaatcc gataaggatc gattccggag cctgaatggc gaatggacgc gccctgtagc 2580 

ggcgcattaa gcgcggcggg tgtggtggtt acgcgcacgt gaccgctaca cttgccagcg 2640 

ccctagcgcc cgctcctttc gctttcttcc cttcctttct cgccacgttc gccggctttc 2700 

cccgtcaagc tctaaatcgg gggctccctt tagggttccg atttagtgct ttacggcacc 2760 

tcgaccccaa aaaacttgat tagggtgatg gttcacgtag tgggccatcg ccctgataga 2820 

cggtttttcg ccctttgacg ttggagtcca cgttctttaa tagtggactc ttgttccaaa 2880 

ctggaacaac actcaaccct atctcggtct attcttttga tttataaggg attttgccga 2940 

tttcggccta ttggttaaaa aatgagctga tttaacaaaa atttaacgcg aattttaaca 3000 

aaatattaac gcttacaatt tcgcctgtgt accttctgag gcggaaagaa ccagctgtgg 3060 

aatgtgtgtc agttagggtg tggaaagtcc ccaggctccc cagcaggcag aagtatgcaa 3120 

agcatgcatc tcaattagtc agcaaccagg tgtggaaagt ccccaggctc cccagcaggc 3180 

agaagtatgc aaagcatgca tctcaattag tcagcaacca tagtcccgcc cctaactccg 3240 

cccatcccgc ccctaactcc gcccagttcc gcccattctc cgccccatgg ctgactaatt 3300 

ttttttattt atgcagaggc cgaggccgcc tcggcctctg agctattcca gaagtagtga 3360 

ggaggctttt ttggaggcct aggcttttgc aaaaagcttg attcttctga cacaacagtc 3420 

tcgaacttaa ggctagagcc accatgattg aacaagatgg attgcacgca ggttctccgg 3480 

ccgcttgggt ggagaggcta ttcggctatg actgggcaca acagacaatc ggctgctctg 3540 

atgccgccgt gttccggctg tcagcgcagg ggcgcccggt tctttttgtc aagaccgacc 3600 

tgtccggtgc cctgaatgaa ctgcaggacg aggcagcgcg gctatcgtgg ctggccacga 3660 

cgggcgttcc ttgcgcagct gtgctcgacg ttgtcactga agcgggaagg gactggctgc 3720 

tattgggcga agtgccgggg caggatctcc tgtcatctca ccttgctcct gccgagaaag 3780 

tatccatcat ggctgatgca atgcggcggc tgcatacgct tgatccggct acctgcccat 3840 

tcgaccacca agcgaaacat cgcatcgagc gagcacgtac tcggatggaa gccggtcttg 3900 

tcgatcagga tgatctggac gaagagcatc aggggctcgc gccagccgaa ctgttcgcca 3960 

ggctcaaggc gcgcatgccc gacggcgagg atctcgtcgt gacccatggc gatgcctgct 4020 

tgccgaatat catggtggaa aatggccgct tttctggatt catcgactgt ggccggctgg 4080 

gtgtggcgga ccgctatcag gacatagcgt tggctacccg tgatattgct gaagagcttg 414 0 

gcggcgaatg ggctgaccgc ttcctcgtgc tttacggtat cgccgctccc gattcgcagc 4200 

gcatcgcctt ctatcgcctt cttgacgagt tcttctgagc gggactctgg ggttcgaaat 4260 

gaccgaccaa gcgacgccca acctgccatc acgatggccg caataaaata tctttatttt 4320 

cattacatct gtgtgttggt tttttgtgtg aagatccgcg tatggtgcac tctcagtaca 4380 

atctgctctg atgccgcata gttaagccag ccccgacacc cgccaacacc cgctgacgcg 4440 

ccctgacggg cttgtctgct cccggcatcc gcttacagac aagctgtgac cgtctccggg 4500 

agctgcatgt gtcagaggtt ttcaccgtca tcaccgaaac gcgcgagacg aaagggcctc 4560 

gtgatacgcc tatttttata ggttaatgtc atgataataa tggtttctta gacgtcaggt 4620 

ggcacttttc ggggaaatgt gcgcggaacc cctatttgtt tatttttcta aatacattca 4 680 

aatatgtatc cgctcatgag acaataaccc tgataaatgc ttcaataata ttgaaaaagg 4740 

aagagtatga gtattcaaca tttccgtgtc gcccttattc ccttttttgc ggcattttgc 4800 

cttcctgttt ttgctcaccc agaaacgctg gtgaaagtaa aagatgctga agatcagttg 4860 

ggtgcacgag tgggttacat cgaactggat ctcaacagcg gtaagatcct tgagagtttt 4920 

cgccccgaag aacgttttcc aatgatgagc acttttaaag ttctgctatg tggcgcggta 4980 

ttatcccgta ttgacgccgg gcaagagcaa ctcggtcgcc gcatacacta ttctcagaat 5040 

gacttggttg agtactcacc agtcacagaa aagcatctta cggatggcat gacagtaaga 5100 
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gaattatgca 
acgatcggag 
cgccttgatc 
acgatgcctg 
ctagcttccc 
ctgcgctcgg 
gggtctcgcg 
atctacacga 
ggtgcctcac 
attgatttaa 
ctcatgacca 
aagatcaaag 
aaaaaaccac 
ccgaaggtaa 
tagttaggcc 
ctgttaccag 
cgatagttac 
agcttggagc 
gccacgcttc 
ggagagcgca 
tttcgccacc 
tggaaaaacg 
cacatggctc 



gtgctgccat 
gaccgaagga 
gttgggaacc 
tagcaatggc 
ggcaacaatt 
cccttccggc 
gtatcattgc 
cggggagtca 
tgattaagca 
aacttcattt 
aaatccctta 
gatcttcttg 
cgctaccagc 
ctggcttcag 
accacttcaa 
tggctgctgc 
cggataaggc 
gaacgaccta 
ccgaagggag 
cgagggagct 
tctgacttga 
ccagcaacgc 
gacagatct 



aaccatgagt 
gctaaccgct 
ggagctgaat 
aacaacgttg 
aatagactgg 
tggctggttt 
agcactgggg 
ggcaactatg 
ttggtaactg 
ttaatttaaa 
acgtgagttt 
agatcctttt 
ggtggtttgt 
cagagcgcag 
gaactctgta 
cagtggcgat 
gcagcggtcg 
caccgaactg 
aaaggcggac 
tccaggggga 
gcgtcgattt 
ggccttttta 



gataacactg 
tttttgcaca 
gaagccatac 
cgcaaactat 
atggaggcgg 
attgctgata 
ccagatggta 
gatgaacgaa 
tcagaccaag 
aggatctagg 
tcgttccact 
tttctgcgcg 
ttgccggatc 
ataccaaata 
gcaccgccta 
aagtcgtgtc 
ggctgaacgg 
agatacctac 
aggtatccgg 
aacgcctggt 
ttgtgatgct 
cggttcctgg 



cggccaactt 
acatggggga 
caaacgacga 
taactggcga 
ataaagttgc 
aatctggagc 
agccctcccg 
atagacagat 
tttactcata 
tgaagatcct 
gagcgtcaga 
taatctgctg 
aagagctacc 
ctgtccttct 
catacctcgc 
ttaccgggtt 
ggggttcgtg 
agcgtgagct 
taagcggcag 
atctttatag 
cgtcaggggg 
ccttttgctg 



acttctgaca 
tcatgtaact 
gcgtgacacc 
actacttact 
aggaccactt 
cggtgagcgt 
tatcgtagtt 
cgctgagata 
tatactttag 
ttttgataat 
ccccgtagaa 
cttgcaaaca 
aactcttttt 
agtgtagccg 
tctgctaatc 
ggactcaaga 
cacacagccc 
atgagaaagc 
ggtcggaaca 
tcctgtcggg 
gcggagccta 
gccttttgct 



5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6439 



<210> 63 

<211> 77 

<212> PRT 

<213> Herpes simplex virus type 2 



<400> 63 



Thr 


Ala 


Pro 


He 


Thr Asp Val 


Ser 


Leu Gly Asp 


Glu 


Leu Arg 


Leu 


Asp 


1 








5 




10 






15 




Gly Glu 


Glu 


Val 


Asp Met Thr 


Pro 


Ala Asp Ala 


Leu 


Asp Asp 


Phe 


Asp 








20 






25 




30 






Leu 


Glu 


Met 


Leu 


Gly Asp Val 


Glu 


Ser Pro Ser 


Pro 


Gly Met 


Thr 


His 






35 






40 






45 






Asp 


Pro 


Val 


Ser 


Tyr Gly Ala 


Leu Asp Val Asp 


Asp 


Phe Glu 


Phe 


Glu 




50 






55 






60 








Gin 


Met 


Phe 


Thr Asp Ala Leu 


Gly 


He Asp Asp 


Phe 


Gly 







65 70 75 



<210> 64 

<211> 44 

<212> PRT 

<213> Herpes simplex virus type 2 



<400> 64 

Ala Asp Ala Leu Asp Asp Phe Asp 
1 5 
Asp Phe Asp Leu Glu Met Ala Asp 
20 

Met Ala Asp Ala Leu Asp Asp Phe 
35 40 



Leu Glu Met Ala Asp Ala Leu Asp 

10 15 
Ala Leu Asp Asp Phe Asp Leu Glu 
25 30 
Asp Leu Glu Met 



<210> 65 
<211> 10 
<212> DNA 
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<213> 



Artificial Sequence 



<220> 




<223> 


engineered 


<400> 


65 


actttatttt 


<210> 


66 


<211> 


10 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


engineered 


<400> 


66 


gagtttttcc 


<210> 


67 


<211> 


10 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


engineered 


<400> 


67 


gatgggattt 


<210> 


68 


<211> 


10 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


engineered 


<400> 


68 


tctttttgtt 


<210> 


69 


<211> 


10 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


engineered 


<400> 


69 


gagttggcgg 


<210> 


70 


<211> 


10 


<212> 


DNA 


<213> 


Artificial 



10 



10 



10 



10 



10 
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<220> 

<223> engineered DNA response element 



<400> 


70 


tctggttgtt 


<210> 


71 


<211> 


10 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


engineered 


<400> 


71 


gagttttgtt 


<210> 


72 


<211> 


12 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


engineered 


<400> 


72 


ccagggcccc ga 


<210> 


73 


<211> 


12 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


engineered 


<400> 


73 


gccgcggtct gt 


<210> 


74 


<211> 


12 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


engineered 


<400> 


74 


cgtccgcggt ga 


<210> 


75 


<211> 


12 


<212> 


DNA 


<213> 


Artificial 



10 



10 



12 



12 



12 



<220> 

<223> engineered DNA response element 
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<400> 75 
tttacttatt tt 

<210> 76 

<211> 7 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> engineered DNA response element 

<400> 76 
gagtttt 

<210> 77 

<211> 9 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> sequence complementary to SEQ ID No 



<400> 77 
aaaacttta 



